Entry Name:  "UBA-Alcoser-MC2"

VAST Challenge 2014
Mini-Challenge 2

 

 

Team Members:

Jorge Luis Alcoser Flores, Universidad Cesar Vallejo (Peru), jl.tauro.88@gmail.com     Primary
Fredy Hernan Gomez Lopez, Franciso Jose de Caldas Bogota - Colombia, dygomez@gmail.com
     
Miguel Francisco Jarma Forero, Universidad del Magdalena - Colombia, mjarma83@gmail.com      

Student Team:  YES

 

Analytic Tools Used:

Tableau, Excel, Sql Server 2012,Spss,Qgis

 

Approximately how many hours were spent working on this submission in total?

Our team spend 56 hours working on the challenge

 

May we post your submission in the Visual Analytics Benchmark Repository after VAST Challenge 2014 is complete? YES

 

 

Video:

 

Youtube: UBA-Alcoser-MC2_Video

Download: UBA-Alcoser-MC2_Video

 

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Questions

 

MC2.1Describe common daily routines for GAStech employees. What does a day in the life of a typical GAStech employee look like?  Please limit your response to no more than five images and 300 words.

 

To find the daily routines for the Gastech employees we make a correlation analysis and find the next results

 



 

MC2.2Identify up to twelve unusual events or patterns that you see in the data. If you identify more than twelve patterns during your analysis, focus your answer on the patterns you consider to be most important for further investigation to help find the missing staff members. For each pattern or event you identify, describe

Evente No.1

Evente No.2

Evente No.3

 

 

 

MC2.3Like most datasets, the data you were provided is imperfect, with possible issues such as missing data, conflicting data, data of varying resolutions, outliers, or other kinds of confusing data.  Considering MC2 data is primarily spatiotemporal, describe how you identified and addressed the uncertainties and conflicts inherent in this data to reach your conclusions in questions MC2.1 and MC2.2.  Please limit your response to no more than five images and 300 words.

 

The problems we encounter when analyzing the data were:

GPS data was reduced by taking only the data of latitude and longitude, reducing it to the minimum length in minute and hour specified. We unify the Transactions of LOYALTY_DATA and CC_DATA tables with records that were identified as the same transaction and creating a variable for whether loyalty card is presented when they made the transaction. Other thing we do to reduce the data was a join between the tables cars_assigment and credit_card transaction to see the car location of the different employees, with their transactions, this was done by first name and last name, and hour of the transactions.

To complement the missing values of the varibles CURRENTEMPLOYMENTTYPE, CURRENTEMPLOYMENTTITLE we used employees records file of the minichallenge1

After performing a deeper analysis of the data we found an outlier which gives us a step to think about doing other types of analysis

We made a join between the transaction date and the GPS date, to have an specific idea of a record in a given time, gps location, location, employee and Price.